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We present estimates of the N-point galaxy area-averaged angular correlation 
functions CJN{d) for = 2,. ..,7 from the third data release of the Sloan Digital 
OO \ Sky Survey (SDSS). The sample was selected from galaxies with 18 < r < 21, 

^ \ and is the largest ever used to study higher-order correlations. The measured 

^ \ ^^n{0) are used to calculate the projected, sn, and real space, Sn, hierarchi- 

\Q I cal amplitudes. This produces highly-precise measurements over 0.2 to 10 h"^ 

^ I Mpc, which are consistent with Gaussian primordial density fluctuations. The 

' measurements suggest that higher-order galaxy bias is non- negligible, as defin- 

^ I ing bi = 1 yields C2 = —0.24 ± 0.08. We report the first SDSS measurement 

^ I of marginally significant third-order bias, C3 = 0.98 ± 0.89, which suggests that 

^ I bias terms may be significant to even higher order. Previous measurements of 

C2 have yielded inconsistent results. Inconsistencies would be expected if differ- 
ent data sets sample different galaxy types, especially if different galaxy types 



H ' exhibit different higher-order bias. We find early-type galaxies exhibit signifi- 

ed ' 

cantly different behavior than late-types at both small and large scales. At large 
scales (r > 1 h^^ Mpc), we find the for late-type galaxies are lower than 
for early-types, implying a significant difference between their higher-order bias. 
We find bi^eariy = 1.36 ± 0.04, C2,eariy = 0.30 ± 0.10, bijate = 0.81 ± 0.03, and 
C2Me = —0.70 ± 0.08. Our results are robust against the systematic effects of 
reddening and seeing. The latter introduces minor structure in ljn{9). 

Subject headings: cosmology: observations — large-scale structure of universe 
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1. Introduction 

A significant portion of our cosmological knowledge comes from studying the statistics 
of the density field traced by observable structure — most often the temperature anisotropics 
of the cosmic microwave background (CMB) or the distribution of galaxies. Many of these 
studies use the two-point correlation function (2PCF). This is largely due to the fact that 
this statistic is relatively easy to calculate and model. The 2PCF completely characterizes 
a Gaussian density field, as odd higher-order moments vanish and even moments can be 
entirely expressed in terms of the two-point function. Using first-order linear perturbation 
theory (LPT) one can show that a Gaussian primordial distribution of density fiuctuations 
will evolve under gravity into a distinctly asymmetric density field. The observed higher- 
order moments of the local density field are predicted to display hierarchical scaling with 
constant hierarchical amplitudes, Sn (see equation 4, or Peebles 1980). 

In the dynamical range where structure grows linearly or quasi-linearly (i.e. larger 
than the typical size of clusters) the Sn are independent of cosmic epoch. Thus, when 
using galaxies to measure Sn, any departure from the stated hierarchy can be due only to 
a non-Gaussian primordial density field or scale dependent bias between galaxies and the 
underlying density field. Further, LPT predicts specific values for the Sn and relates these 
values to the measured Sn for galaxies and the bias. Thus, measuring the higher-order 
correlation functions of the local density field to high precision can constrain the conditions 
of the primordial density field and the bias between galaxies and the underlying matter 
distribution. 

Early measurements of Sn suggested that the amplitudes are non-zero and support the 
hierarchical model on small scales (e.g., Groth & Peebles 1977; Saunders et al. 1991; Bouchet 
et al. 1993; Gaztanaga 1994). More recent measurements endeavor to constrain Sn on larger 
scales: Using just over 200,000 galaxies from the 2dF Galaxy Redshift Survey (2dFGRS) at 
a median redshift of 2; ~ 0.11, Croton et al. (2004) found (after removing two major super- 
clusters from their field) approximately constant Sn for < 5 to 10 h^^ Mpc. Most recently, 
using just under 651,000 galaxies selected from the 2 Micron All Sky Survey (2MASS) with 
Kg <13.5 at a median redshift of z — 0.074, Frith, Outram, & Shanks (2005b) found constant 
S'at for A^ < 7 to 30 Mpc. This result strongly supports Gaussian initial conditions. Their 
measurements, however, also allowed them to determine the second-order bias parameter, C2, 
to be 0.57±0.33 for Kg band galaxies, only barely consistent with zero higher-order modes for 
galaxy bias and of opposite sign to previous results. This measurement was based on their 
previous measurement of bi — 1.39 ± 0.12 using the 2MASS galaxy angular power-spectrum 
(Frith et al. 2005a). 

Separate recent measurements of two-point and three-point correlations using optical 
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surveys have constrained the first- and second-order bias parameters. By calculating the 
redshift space three-point correlation function, Gaztaiiaga ct al. (2005) found bi = 0.94^q}^ 
and C2 = -OMtoll for bj < 19.45 galaxies in the 2dFGRS (ca = ^). Pan & Szapudi 
(2005) found, by studying the monopole contribution to the 2dFGRS three-point correlation 
function, that, for galaxies of absolute magnitudes bj between -21 and -20, bi = I.OA'^q'qI 
and 62 = -0.06+°:°°^ Hikage et al. (2005) found, using the Sloan Digital Sky Survey (SDSS) 
spectroscopic galaxy catalog to calculate the bispectrum, that C2 is consistent with zero to 
within 10%. Comparing early measurements of optical and IRAS galaxies, Pry & Gaztanaga 
(1993) determined that there exists both significant second-order bias and third-order bias 
between the two samples, suggesting that the measured second- and third-order bias terms 
may be dependent on galaxy type (as an infrared survey will certainly select galaxies of 
different spectral properties than an optical survey). 

The measurement of C2 by Prith, Outram, & Shanks (2005b) is somewhat puzzhng 
in light of measurements using optical surveys. The authors offer the explanation that the 
2MASS data set likely contains a much larger fraction of early-type galaxies than typical 
optical surveys. It is thus becoming important to determine the relationship between bias 
and galaxy type. It has been shown (e.g., Willmer et al. 1998; Zehavi et al. 2002; Norberg 
et al. 2002; Madgwick et al. 2003) that early-type galaxies cluster more strongly than other 
galaxy types, leading to a larger bi for early-type galaxies. By calculating S3 for both early- 
and late-type galaxies and comparing the results to those for the full sample of galaxies, 
one can determine the dependence of bi and C2 on galaxy type. Of course, subdividing a 
sample by galaxy type becomes statistically more meaningful in larger data sets, such as 
that provided by the SDSS third data release (DR3). 

Previously, Szapudi et al. (2002) measured third- and fourth-order correlations for galax- 
ies in the SDSS Early Data Release (EDR) and found that these measurements should be 
highly free of any systematics. In this paper, we calculate and analyze the area-averaged an- 
gular correlation functions using SDSS DR3 galaxies, up to seventh-order. Angular surveys 
have several advantages to redshift surveys when calculating high-order correlations: (1) they 
include vastly more objects; (2) redshift distortions are not a problem (see, e.g., (Kaiser N. 
1987; Hamilton A. J. S. 1992)); and (3) they are not susceptible to rare peaks in the galaxy 
density field (see, e.g., Croton et al. 2004; Nichol et al. 2006). These factors, coupled with 
the area and depth of the SDSS DR3 should allow precise measurements oi Sn and their 
dependence on galaxy type. This in turn will allow us to determine C2 and its dependence 
on galaxy type. After discussing our data in §2 and techniques in §3 and §4, we present 
our measurements of the higher-order galaxy correlations for early-type, late-type, and all 
galaxies in §5. In §6 we discuss implications for linear and non-linear biasing schema and its 
dependence on galaxy type. Throughout our analysis, we assume a flat ACDM cosmology 
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(e.g., Spergel et al. 2003). 



2. 



Data 



The data for these measurements were taken from the SDSS DR3 (Abazajian et al. 
2005). This survey obtains wide- field CCD photometry (Gunn et al. 1998) in five passbands 
{u,g,r,i,z] e.g., Fukugita et al. 1996). The completed SDSS should cover ~10,000 square 
degrees. DR3 alone covers just over half of that area (5282 square degrees). We selected 
galaxies with positions lying in the northern, contiguous portion of the SDSS from a DR3 
PhotoPrimary database and further constrained the sample (using the Schlegel, Finkbeiner 
& Davis 1998 dust maps) to have reddening-corrected magnitudes in the range 18 < r < 21. 
Further, significant masking was required to account for bright stars, areas of high reddening 
and poor seeing, (as discussed in detail in section 4.) This produces a set of just over 11 
million galaxies (11,171,958) at a median redshift of about 0.4. This is by far the most 
galaxies used to conduct this measurement, and our depth is much greater than that of 
the recent 2MASS and 2dRGRS measurements. This base data set was split into five sub- 
samples, three with magnitude ranges 18 < r < 19, 19 < r < 20, and 20 < r < 21, a sample 
of early-type {u — r > 2.2) galaxies, and a sample of late-type {u — r < 2.2) galaxies. The 
bulk of our analysis centers on this base data set and its five subsamples. 



We estimate N-point area-averaged angular correlation functions, ujn{6), using a counts- 
in-cells technique (e.g., Peebles 1980; Gaztariaga 1994; Szapudi et al. 2002). We divide DR3 
into cells of equal angular area, so that each cell i contains some number of galaxies (n^). 
The average number of galaxies in a cell is then n. The over density for cell i is thus 



The central moment of the angular counts, mj^{9), is calculated over all of the cells 



3. Methodology 



3.1. Angulcir Correlation functions 
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where 5f{9) and n{9), are for cells with an area that has an equivalent circular radius to the 
angular scale (9) being investigated, nc{9) is the total number of cells at that angular scale, 
and (...) represents a statistical average. The area-averaged N-point functions, uii\f{6), can be 
explicitly calculated from mi\i{9) via the connected central moments, ^n{9) = {(^))c, 
as shown (for < 7) in Appendix A. The subscript c in {S^{9))c refers to the connected 
part of the expectation value, and we include it because for N > 2, {Sf{9))c 7^ (^i^(^)) but 
requires the subtraction of factors of lower order moments. The connected moment would 
equal cun{9) if not for shot-noise error, 5sn, due to the discreteness of the counts-in-cells 
statistic, i.e. 



u;N{e) = {6'^{e)),-5sN (3) 

In a hierarchical model (e.g., Groth & Peebles 1977; Szapudi et al. 1992; Gaztanaga 
1994), higher-order correlations can be expressed in terms of the two-point correlation func- 
tion and the volume-averaged correlations are given by 



UR) = s^URT-' (4) 

where Sn is the hierarchical amplitude. In a similar manner, we can define the analogous 
relationship for the area-averaged angular correlations 

The hierarchical amplitudes of the higher-order moments encode much of the pertinent 
information on the distribution of the data. These amplitudes, therefore, embody the central 
analysis of this paper. 

Cross-correlations between the galaxy density and contaminants that could add system- 
atic errors to our measurement are also calculated. As in Scranton et al. (2002), the cross 
correlation at an angular bin 9a is given by 

^9c{0a)-J^j:^m^ (6) 

where 9fj is unity if the separation between cells i and j is within the angular bin 9^ and 
zero otherwise, subscript c denotes a particular contaminant (e.g., reddening, seeing), and 
N{9a) is just the number of cells within the angular bin 9a- 
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3.2. Inversion to Three Dimensions 

To compare results of angular correlation measurements to the predictions of pertur- 
bation theory and to constrain galaxy bias, it is easiest to deproject the two dimensional 
measurement into a three dimensional measurement. Given the redshift distribution of the 
source population, the angular hierarchical amplitude {sn) can be converted directly to an 
estimate of the real-space amphtude (Sn)- The relationship is (Gaztanaga 1994), 

rNivCNil) 

where f = 2^^ is the mean scale probed for a survey of given depth D at angular scale 6 
-C 1 radian), and 7 is the slope of the real-space correlation function, which we determine 
to be 1.75 (see §5.1). The values Bm{'^) and Cn{'^) were determined by Gaztanaga (1994) 
by considering the number of different configurations of hierarchical tree-graphs. They are 
of order unity, but non-negligible for large N . The remaining factor (r^v) is determined by 
the selection function [ip) for the sample of objects: 

tN-2t 

= -fW^, (8) 
-'2 

poo 

/ V^^xM(^-i)(l + ^)(3+-^)(i-^)F(x)a;2cix, (9) 
Jo 

where x is the comoving distance, e is a factor that describes the evolution of clustering, 
which we take to be zero, and F{x) is a curvature term that is unity for a fiat universe. 

When ^ is known for the projected sample, equation 9 is readily evaluated in redshift 
space. Using the transformations g = x^^, {^)^dx = i^Y^' H)^''^dz, and g = 
where n is the number of objects, c the speed of light, and H{z) the Hubble parameter, 
yields 

= "^"'(1 + ^)(3-7)(i-^) dz. (10) 

We will also wish to determine the relationship between 0)2(6') and i2{R)- Assuming the 
forms (Peebles 1980): 

e2(i?,^) = \i+zr^'+^'^ (11) 

and 

u^{e) = Ae^-'^ (12) 

the constant A is given by 

A = H^hRl (13) 
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where — r(0.5)r(0.57 — 0.5)/r(0.57), V is the gamma function. Given both A and 7 — 1, 
we can derive Ro and thus the form of ^2(-R, z). 

In order to determine we used the photometric redshifts of galaxies from the SDSS 
first-data release (DRl; Abazajian et al. 2003). We construct ^ by using each published 
redshift and its error (rejecting any with error greater than twenty percent) to create a 
probability density function (PDF). The PDFs for each redshift are combined to produce 
the expected number of objects (n) in a redshift bin A2; = 10^^. This distribution can 
then be normalized and interpolated over in order to estimate For SDSS galaxies with 
18 < r < 21, the resulting normalized n{z) is plotted in Figure 1. The distribution of n{z) 
is smooth and roughly Gaussian. Since DRl is a sub-sample of DR3, the redshift selection 
functions are nearly identical. The n{z) plotted in Figure 1 is thus an acceptable estimate of 
the true n{z) for our full sample of galaxies. For our five subsamples (see §2), we likewise use 
the corresponding DRl photometric redshifts to obtain estimates of their redshift selection 
functions. 



3.3. Pixelation 

Tegmark, Xu, and Scranton^, have developed a software package (henceforth SDSSpix) 
that pixehzes the sky in a manner convenient for SDSS data. Working in SDSS A/77 coor- 
dinates (see Stoughton et al. 2002) , SDSSpix creates almost-equal-area, pseudo-rectangular 
pixels over the entire sky. SDSSpix breaks SDSS stripes into a number of rectangles (tran- 
scribed on the surface of the sphere), and hierarchically bisects parent rectangles into in- 
creasingly smaller rectangles (pixels). Thus, to probe any desired angular scale, one only 
needs to bisect the parent SDSSpix rectangles down to the corresponding resolution in the 
SDSSpix equal-area schema. Thus, given information for pixels at the smallest angular scale 
(the base resolution), gaining information at larger angular scales is trivial. Statistical counts 
can be easily, and rapidly, calculated for large angles simply by accurately determining them 
for the smallest scale of interest. 

For this work, the basic methods of SDSSpix were utilized for the selected pixelation 
schemes. SDSSpix pixelates the entire sky, which leads to a burdensome number of pixels at 
small scales. Therefore, at small scales {9 < 0.1 degrees) we reimplemented the algorithms 
used to make equal area pixels over the entire sky, in a manner that took advantage of the 
survey strategy. DR3 is broken into 18 stripes. If one uses the survey A/77 coordinates, each 
stripe makes a rectangle that is 2.5 degrees wide in the 77 coordinate and typically between 



^http://lahniu.phyast.pitt.edu/'~scranton/SDSSPix/ 
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90 and 110 degrees in the A coordinate. Since each stripe has a unique range in A, the SDSS 

DR3 essentially consists of 18 data subsections. Each stripe was thus given its own set of 
pixels based on its A coordinate, meaning that the majority of the sky, not covered by DR3, 
could quickly be discarded. The correlations were measured for each stripe, with the full 
measurement being the composite average over all stripes, inverse-variance-weighted by the 
number of objects in that stripe. 

For larger angular scales {6 > 0.1 degrees), averages over the entire sample are required 
to obtain an accurate result. On these scales, pixelating the entire sky is less computationally 
challenging, and SDSSpix was directly used for these scales. As a result, the correlations were 
measured at these scales over the entire DR3 area. We have verified that the results using 
the striped method are consistent with using the default SDSSpix implementation at the 
scale of 0.1 degrees. Prom here on, we will refer to the method used for small angular scales 
as the striped method and to the method for large angular scales as the SDSSpix method. 

3.4. P(n) plots 

For each resolution we measured the probability of finding a cell with n counts in it, 
P{n). This serves to justify the pixelization scheme used. Ideally, these plots show a smooth 
curve centered at the most probable count, as this signifies sufficient sampling of the data. 
Figure 2 shows the P{n) plots for resolutions that correspond to angular scales between 0.02 
and 5.2 degrees. The P{n) become less smooth as the angular scale gets larger but appear 
sufficient for the purposes of our measurements to an angular scale of about 1.3°. At a 
scale of 2.6°, Figure 2 suggests that a measurement will have a large associated statistical 
error. At a scale of 5.2°, it appears unlikely that any information can be determined to any 
precision. This is due to the fact that we use non-overlapping cells in our analysis and thus 
have few pixels at this large of a scale. As a result, we expect to be able to make precision 
measurements to an angular scale of ~ 1°, and relevant measurements to an angular scale 
of ~ 3°. 



3.5. Errors and Covariance 

We compute errors and covariance matrix using a jackknife method (see, e.g., Scranton 
et al. 2002; Myers et al. 2003, 2005). To properly constrain fit parameters, we calculate 
covariance matrices by following the prescription of Myers et al. (2006b), which entails 
splitting the data into N subsamples and calculating the covariance between angular bins 9i 
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and 6j as 

= <^[<yi, lyj) = =;]v — 2 ^^'^^ 

where u)fe(^) is the value for the correlation measurement omitting the fcth subsample of 
data, and the number of galaxies in subsample A^, Ugai, is used to inverse- variance weight 
the contribution of that subsample to the covariance. The jackknife errors, cxj can be obtained 
from the diagonal elements of the covariance matrix 

= (15) 

This jackknife technique is used to calculate the covariance for every order correlation. We 
estimate fits to model functions {cUm) using the inverse of the covariance matrix. 

= YlP{e^) - u;m{e^)]c-'p{e,) - u^{e,)] (i6) 

All of the averages and power-law fits conducted throughout the rest of the paper are ac- 
complished by minimizing the defined in Equation 16. 

For the striped method, we utilize the natural geometry of the SDSS. Each of the 18 
different stripes in the DR3 forms a natural subset of the overall data. The the covariance 
matrix is thus calculated using each of the possible subsamples of DR3 that is made up of 
17 stripes. For the angular scales that we utilize the striped method, we find that these 18 
subsamples are sufficient to create a stable covariance matrix. For the larger angular scales 
probed by the SDSSpix method, we find that many more jackknife subsamples are necessary. 
Our covariance matrix is not stable until we utilize at least 100 jackknife subsamples. We 
therefore use 150 jackknife subsamples for all calculations involving the SDSSpix method. 

4. Masks 

We will generally refer to useful observational information (such as seeing and Galactic 
extinction values) across each pixel in our schema (see §3.3) as forming a mask of that 
information. The DR3 area required significant masking to be statistically useful. For 
instance, any pixels at the base resolution of our schema that intersect the standard SDSS 
imaging masks (which cover areas missing from DR3 due to, for example, satellite trails. 
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charge transfer, bright objects^) are discarded (masked) from our analyses. During our tests 
of stripe variation, we found that a mask for galaxy MlOl {a = lA^ 03™ 12*, 5 = 54 21' 00") 
was not included in the standard DR3 imaging masks, and thus we also mask an area of 
~ 0.25 square degrees around the center of MlOl. We further discard pixels based on their 
mean value of seeing or reddening, as will be described in the the subsequent sections. 

Pixels are masked out at the base resolution of the pixelization scheme (the smallest 
angular scale that we calculate, see §3.3). For pixels above the base resolution, the fraction 
of the area of the pixel that is unmasked is calculated using the base pixels. The over density 
(previously defined by Equation 1 in §3), 5i in pixel i becomes 

n - ^ 

Si = —r^ (17) 
n 

where Aj is the fractional area of cell i. This allows for a complete mask that is consistent 
on all angular scales. 



4.1. Reddening 

As shown by Scranton et al. (2002), including areas of high Galactic obscuration (red- 
dening), can alter statistical counts for extragalactic objects. We therefore tested to see how 
this Galactic reddening affected our analyses. We concentrated our analysis on the large 
angle measurements since it has been shown (e.g., Scranton et al. 2002, Myers et al. 2006) 
that reddening affects correlation measurements on scales larger than ~ 0.1 degrees. In order 
to determine how reddening affects the eventual hierarchical amplitudes, we first tested the 
result of the measurement of S3 for galaxies with 18 < r < 21 and g^-band reddening cuts at 
Ag < 0.35, 0.3, 0.25, 0.2, and 0.15, as shown in Figure 3. At any angular scale, the effect of 
changing the reddening cut appears to be negligible. Figure 4 shows the relationship between 
the measured S3 at 1.3°, and the reddening cut, between Ag < 0.5 and Ag < 0.175. There 
is a slight trend for the amplitudes to decrease, but this is negligible given the size of the 
error bars. Based on the SDSS EDR, Scranton et al. (2002) determine a reddening cut of 
Aj. < 0.2 (~ Ag < 0.3) to be appropriate. In Figures 3 and 4, the values at this reddening 
cut have small errors compared to both less and more restrictive cuts. Thus, we also use a 
reddening cut at ^4^ < 0.3 throughout our analyses. 



^http://www.sdss.org/dr3/products/iniages/use_masks.html 
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4.2. Seeing 

We have also tested the effects of seeing in our correlation measurements. The hierar- 
chical amplitude S3 is shown in Figure 5 for seeing cuts at 1'.'4, 1'.'5, 1'.'6, l'/7. 1"9, and 2'.'2, 
for galaxies with 18 < r < 21. There are noticeable differences in the value of S3 depending 
on the seeing cut. The error on the measurement appears to be minimized at a seeing cut 
of I'.'S for the majority of the angular scales. 

To further quantify the effect of seeing in the correlation measurements, the cross- 
correlation between galaxy density and the mean seeing per pixel was determined for galaxies 
with 18 < r < 21 for seeing cuts between 1'.'9 and l'/4. This data is presented in Figure 6. 
The ideal seeing cut is one that produces a cross-correlation that is consistent with being 
fiat and with zero. None of the seeing cuts produce results that are flat, but the seeing cut 
at I'.'S produces a correlation that is most consistent with zero at large scales. The fact that 
the seeing-galaxy cross correlation increases at small scales should not be a problem, because 
this increase is insignificant in comparison to the amplitude of the galaxy auto- correlation 
at these scales. Thus, a seeing cut at I'.'S is used throughout our analyses. 

4.3. Variation Between Stripes 

As a final systematic test, we looked at the variation in our correlation measurements 
as a function of the SDSS stripe. Some variation between stripes was expected, but any 
stripes whose values deviated seriously from the mean could signify a problem with the data 
in that stripe. Figure 7 shows the measured uj2{d) and S3(^^) for galaxies with 18 < r < 21, 
measured in each individual stripe used for this work. There are no significant differences 
between stripes, and thus no one stripe significantly alters the outcome of our measurements. 
We repeated this analysis for each of our five subsamples (see §2), and found that no one 
stripe significantly altered the outcome of any measurements. As a result, we use all 18 
stripes in our analysis. We note that this would not have been the case had we not masked 
the area around MlOl from our analysis (see above). 

5. Results 

5.1. Area-averaged Correlation Functions and Hierarchical Amplitudes 

Figure 8 shows the area-averaged correlation functions for < 7 determined for galaxies 
with 18 < r < 21. Errors for each point were determined by the jackknife method (Equations 
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14 and 15). For each N, the correlation function has a shape consistent with a power-law, 

but for N > 2 errors dominate at scales large than ~ 1°. Figure 8 displays the result using 
both the striped and the SDSSpix method between about 0.08 and 0.4 degrees showing the 
two methods are in agreement. Assuming a form U2 = A6^~'^, the best model fit for N = 2 
over the angular range 0.1 < ^ < 2° is A = (7.45±0.02) x lO'^ and 7 = 1.757±0.002. These 
results are consistent with previous measurements (see, e.g., Gaztanaga 1994; Connolly et 
al. 2002; Frith, Outram, & Shanks 2005b). 

The hierarchical amplitudes sn{0) for N < 7 measured in magnitude ranges 18 < r < 19, 
19 < r < 20, 20 < r < 21, and 18 < r < 21 arc shown in Figure 9. The errors on the 
measurement of Sf^{9) arc calculated using Equations 14 and 15. Our results show that the 
amplitudes remain roughly constant over this angular range, although there is interesting 
structure. The differences between the magnitude ranges are more easily compared if one 
deprojects the sjv to obtain estimates of the real-space amplitudes, S'jv (see Equation 7). 
These amplitudes, plotted over the same magnitude ranges as the sn, are displayed in Figure 
10. For each magnitude range, the shape of S3 is remarkably similar. Its value is nearly 
constant until R reaches a scale of about 10 Mpc after which it declines sharply. 

For N > 3, the shape is not as consistent between the different magnitude bins, but 
strong similarities remain. Interestingly, the brightest magnitude bin has the most consis- 
tently constant amplitudes and the error bars are not significantly larger than those in the 
other magnitude bins (and in some cases, they are significantly smaller), despite having the 
fewest number of objects (~ 1 million galaxies in 18 < r < 19; ~ 2.75 million in 19 < r < 20; 
and ~ 7.5 million in 20 < r < 21). This is likely due to the fact that the brightest magnitude 
bin has the smallest spread in redshift and is therefore sampling the smallest volume. All 
four magnitude bins show a dramatic dip in the amplitudes between 1 and 5 Mpc, with 
subsequent flattening or increase in the calculated amplitude values. This feature is ampli- 
fied with increasing N such that ioi N — 7 the decrease is nearly an order of magnitude, 
while for A'^ = 4 it is closer to a 50% decrease. The fact that we see the same general features 
at the same physical scales (and thus different angular scales) suggests that the features we 
see are real and are not caused by our treatment of the data. 

We further quantify the degree to which the amplitudes are constant by assuming a 
relationship Sn = B^* 0°"^ and comparing aN to zero. Splitting the data for the magnitude 
range 18 < r < 21 into regions 0.2 < f < 2.0 and 2.0 < f < 10 Mpc and performing 
fits to the assumed power-law behavior gives the results displayed in Table 1. These 
results are not consistent with zero slope. Repeating for galaxies in the magnitude range 
18 < r < 19, one obtains the results in Table 2. The brighter galaxies have amplitudes that 
are more consistent with constant values. Although for both samples the general nature of 
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the amplitudes is a hierarchy of roughly constant Sn, there remain clear differences in the 
measured amplitudes between bright galaxies and the full sample. 

It is instructive to find the average values of the hierarchical amplitudes for comparison 
with previous studies and the predictions of Perturbation Theory. We perform fits to a 
constant value for both sn and Sn ior N < 7 between 0.2 and 10 Mpc in the 18 < r < 19 
and 18 < r < 21 magnitude ranges. These results are presented in Table 3. The values of 
Stv are in fair agreement with previous results determined for the APM Survey (Gaztaiiaga 
1994) and for the 2MASS extended source catalog (Frith, Outram, & Shanks 2005b), though 
the data taken from 18 < r < 19 are in better agreement, especially at lower order. One 
would not expect a perfect match between the results of different surveys given that it is 
clear that the amplitudes are not constant, different surveys will select different spectral 
types, and the different techniques were used to estimate the redshift distributions. 



5.2. N-point Correlations for Eeirly and Late-type Galetxies 

Strateva et al. (2001) provide simple color criteria that can be used to separate carly- 
(m — r > 2.2) and late-type {u — r < 2.2) galaxies from SDSS data. Using this color cut, we 
separate our sample into early- and late-type galaxies and repeat our N-point measurements 
for these two samples. It is important to note that the redshift distributions for early- 
and late-type galaxies in the magnitude range 18 < r < 21 are quite similar. The median 
redshift of the early-type galaxies is 0.361, while it is 0.374 for late-type. The fuU-width- 
half-maximums of the distributions are 0.35 and 0.39, respectively. It is therefore reasonable 
to expect that the two populations primarily sample the same epochs. Thus, we determine 
that direct comparison of the correlation functions determined for each galaxy type is fair. 

Figure 11 shows the results of the area- averaged correlation measurements for early- and 
late-type galaxies. The early-type galaxies clearly show stronger clustering at all angular 
scales, in agreement with previous results (e.g., Willmcr ct al. 1998; Zehavi et al. 2002; 
Norbcrg et al. 2002; Madgwick et al. 2003), and their correlation functions display a power- 
law behavior for all N to large angular scales. The best model fit for 1 — 7 over scales 
0.01 < 9 < 2° for early-types is -0.846 ± 0.002. The late-type galaxies, on the other hand, 
look quantitatively different. The magnitude of the slope of the correlation functions appears 
to decrease between 9 ~ 0.01 and ^ ~ 1.0, with the transition occurring around ^ ~ 0.1. The 
difference in slope at small angles relative to larger angles becomes even more pronounced 
for the high-order angular correlations. Determined over the same angular scales as the 
early-types, the best model fit for 1 — 7 for late-types is -0.677 ± 0.002 — significantly 
more shallow than that for early-types. 
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The transition between the small-angle and large-angle slopes of the late- type un results 
is smooth rather than abrupt, and actually appears as an inflection point at scales corre- 
sponding to about 1 h^^ Mpc, similar to the inflection point observed by Zehavi ct al. (2005). 
This inflection point is more clearly pronounced for latc-type galaxies, and the transition 
region itself becomes more pronounced as increases. In addition, the small-angle slope of 
the late-type galaxy correlation functions increases as the order of the correlation function 
itself increases. Conversely, the early-type galaxies do not show any dramatic change with 
increasing N . We discuss these observations in more detail in section 6.3. 

The hierarchical amplitudes sa; and Sn for early- and late-type galaxies with 18 < 
r < 21 are plotted in Figure 12. Interestingly, the error on individual points is larger on 
the measurement of the late-type correlation functions, despite the fact that we find more 
late-type galaxies (6,223,510) than early-type galaxies (4,948,448). This might imply greater 
cosmic variance in the clustering of those objects we classify as "late-type" . 

The early-type measurements show behavior that is quite similar to the measurement 
for all galaxies, (over 2 < f < 10 /i""^ Mpc, 0:3 = -0.23 ± 0.04 and 0:5 = -1.1 ± 0.1; over 
0.2 < f < 2.0 h-^ Mpc, = 0.07±0.02, and = 0.41 ±0.08). These values are quite close 
to those determined using all galaxy types, despite the fact that there is a large difference in 
the slopes of oJN.eariy aud iON.aii- At scalcs f > 2 h"^ Mpc the Sn for late-type galaxies are 
marginally consistent with being constant (e.g., 0:3 = — 0.12±0.07). At small scales, however 
(f < 2 h"^ Mpc), the arc highly inconsistent with being constant (e.g., «3 = — 0.33±0.04, 
tts = —4.0 ± 0.3). Table 4, respectively, present the host fit average values of s^r and Sn 
for early- and late-type galaxies in the range 1 < f < 10 h^^ Mpc. The average values of 
Sn and Sn for early-type galaxies are significantly higher than those for late- types, and the 
factor increases through order N — 7. 

Figure 13 shows the ratios of 153 and 6*4 for early- to late-type galaxies. Both ratios 
show significant structure. There is a maximum in both ratios at ~ 1 h^^ Mpc. At the 
smallest scales, SN,iate is larger than Sn, early Figure 12 shows that the small scale structure 
is largely dependent on the nature of SN,iate- Figure 13 shows that at scales larger than ~ 1 
h"^ Mpc, the ratios are roughly constant. The ratios of 5*3 are slightly less than two and the 
ratios of 5*4 are close to three at these scales. This startling difference between the grouping 
behavior of early- and late-type galaxies as a function of scale can be speculatively explained 
by several scenarios, which we will discuss in some depth in section 6.3. In general, our 
results trace higher-order biasing schema, and can be well described by differences in bias 
between galaxy types. We explore biasing constraints in the following section. 
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6. Discussion 

Our general results arc consistent with previous measurements (e.g., Gaztafiaga 1994; 
Szapudi et al. 2002; Croton et al. 2004; Frith, Outram, & Shanks 2005b). The measured Sn 
display the hierarchical scaling that one expects to observe for an initially Gaussian density 
field that has evolved due to gravitational instabilities. To properly constrain primordial 
Gaussianity, we would need significant measurements at scales where there is linear clustering 
(f > 10 Mpc). Since this is where errors begin to dominate our measurement, we do 
not place any strong constraints on primordial Gaussianity. We note that there are no 
structures in the measurements at scales greater than 4 Mpc that are consistent over 
all magnitude bins and all N. The are, however, consistent with constant amplitudes at 
the 2(7 level for all N and all magnitude bins, consistent with Gaussian initial conditions. 



6.1. First-Order Galaxy Bieis 

Galaxies are tracers of the underlying dark matter distribution. As a result there is no 
guarantee that galaxies cluster with the same amplitude as the underlying dark matter. At 
scales where the clustering is linear or quasi-linear, one can relate the over-density, Som, of 
dark matter halos to the measured over-density of galaxies, dg, by (Fry & Gaztanaga 1993): 

fc=0 

The first-order bias is related to the measured ^2 and the dark matter ^2 by 

^2,T = b\rp^2,DM (19) 

where the subscript T stands for the galaxy sample (either early, late, or all; all refers to 
magnitude bin 18 < r < 21 unless otherwise noted) used in the measurement. By using 
Equations 11-13, we can determine the relative first order bias (61) for each galaxy sample: 



Kt = (20) 

V J^o,DM / 

Galaxies, as a whole, have negligible first-order bias (e.g., Seljak et al. 2005; Gaztanaga et al. 
2005; Pan & Szapudi 2005). We thus take Ro,dm = Ro,aU and thereby determine 61 for early- 
and late-type galaxies. We find that bi^eariy — 1-36 ± 0.04 and 6i,/ate — 0.81 ± 0.01. This is 
consistent with findings that early- type galaxies cluster more strongly than late-types (e.g., 
Willmer et al. 1998; Zehavi et al. 2002; Norberg et al. 2002; Madgwick et al. 2003), but the 
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ratio of early- to late- type bias of ~ 1.75 is significantly larger than that found by Willmer 
et al. (1998) of ~ 1.2. The ratio of bi^eariy to bi^iate is largely independent of the assumption 
that bi^aii = 1 (a 50% change in Ro,dm results in a 6% change in the ratio). 

Assuming that bias terms bk are negligible for A; > 2, a simple relationship follows from 
Equation 18 that Sn,t — b\~r^ Sn,dm- Using this relationship and the average values of Sn 
for R between 1 and 10 Mpc (where both SN,eariy and SN,iate have nearly zero slope) 
and iV = 3, 4, 5, we calculate the ratio between bi^^ariy and bi^iate- The results for = 3, 4, 5 
are, respectively, 0.74 ± 0.11, 0.73 ± 0.22, and 0.50 ± 2.3. These are quite consistent with 
each other, but wildly inconsistent with both the results found from U2 and the notion that 
early-type galaxies cluster more strongly than late-type galaxies (e.g., Willmer et al. 1998; 
Zehavi et al. 2002; Norberg et al. 2002; Madgwick et al. 2003). Clearly, assuming higher- 
order bias terms are negligible leads to a startling contradiction. This inconsistency can be 
resolved only by incorporating higher order bias terms. 



6.2. Higher-Order Galcixy Bias 

Using Equation 18 a relationship can derived, valid on linear and quasi-linear scales, 
between the measured Sn,t, the Sn,dm, bi^T, and cn,t {cn = |f )• For N — 3, this is given 
by (Pry & Gaztanaga 1993) 

S3,T — bi^{S3^DM + 3c2,r) (21) 

and for A" = 4, 

Using perturbation theory Juszkicwicz et al. (1993) determined S^^dm to be f -(n + 3) by , 
where n is the slope of the matter power spectrum. Recently, several groups have converged 
to a canonical value of n = — 2 to within 5% (e.g., Percival et al. 2001; Tegmark et al. 2002; 
Cole et al. 2005); thus, we use n — —2 for our analysis. Since Equations 21 and 22 are 
valid only in the quasi-linear and linear regimes, and since error dominates our measurement 
at scales greater than 10 Mpc, we calculate the best fit average value of C2 in the 
range between 4 and 10 hr^ Mpc. We find C2,a«« = -0.24 ± 0.08, C2^eariy = 0.28 ± 0.10, 
and C2^iate = —0.70 ± 0.08. As a consistency check, we also measure C2^aii for galaxies in 
the magnitude bin 18 < r < 19, finding C2,aii = —0.31 ± 0.08. The two measurements are 
consistent. 

The result for all galaxies is entirely consistent with the result of Gaztanaga et al. 
(2005) of C2 = -0.36^[|:J^. If we arc to use the value of bi determined by Gaztanaga et 
al. (2005) of 0.94]);q^^, the resulting C2 is —0.30 ± 0.15, which is in remarkable agreement 
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with their previous measurement. Our C2 is slightly inconsistent with the results of Pan & 
Szapudi (2005) and Hikage et al. (2005), but as noted by Gaztanagaet al. (2005), differences 
between measurements should not be surprising given that they are conducted in different 
ways and/or use different galaxy catalogs. In fact, given our results that early- and late-type 
galaxies have different second order bias properties, measurements using different galaxy 
catalogs will likely result in different C2 measurements unless the fractions of early- and 
late-type galaxies are nearly identical in the two samples. 

The positive result for C2^eariy is consistent with the recent measurement of C2 = 0.57 ± 
0.33 made by Frith, Outram, & Shanks (2005b) and supports their assertion that their 
measurement of positive second order bias was likely due to the dominance of early-type 
galaxies in their 2MASS sample. The second-order bias we measure for late-type galaxies is 
significantly negative, and it is inconsistent with zero at 9a. 

Given the values of C2, one can go on to determine the significance of the third-order bias. 
Bernardeau (1994) finds that for n = -2, Sdm,4 = which results in c^^aii = 0.98 ±0.89, 

C3,eariy = 1.46±1.37 and cs^iate — 0.63±1.04. These are all positive, though all are consistent 
with zero at about the la level. It seems likely, based on these results, that given extremely 
precise measurements of Sn and analytic expressions for Sdm,n, that cn would be found to 
be non-neglibible for all N. Table 5 summarizes all of our bias measurement results. 



6.3. Early- and Late-type GalELxies 

In the previous section, we quantified the differences between the clustering behavior of 
our early-type and late- type galaxy samples. Figures 11 and 13, however, also can provide 
qualitative insights into the clustering differences between these two populations. In this 
section, we explore these two figures in more detail, focusing on the interpretation of Figure 11 
in light of the halo occupation distribution framework (HOD; see, e.g., Berlind & Weinberg 
2002), and on the interpretation of Figure 13 in terms of the interplay between star formation 
and local density. 

In Figure 11, we observe several important effects, previously detailed in Section 5.2. 
Specifically, these are that (1) early- type galaxies always cluster more strongly than late- 
type galaxies; (2) the slope of the late-type galaxy correlation function increases on smaller 
angular scales; (3) this change in slope is smooth, indicating an inflection point; and (4) both 
the small- angle slope and the width of the transition between the large- angle and small- angle 
late-type galaxy correlation functions increase for higher orders. Of these effects, the last 
three provide insight into the distribution of different galaxies within dark matter halos. 
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The shape of 0)2 for our early- and late-type galaxy samples appears to be consistent 
with the HOD determined by Zehavi et al. (2005) of red (early-type) galaxies and blue 
(late-type) galaxies. Zehavi et al. (2005) find that smallest mass halos (< lO^^h^^ Mq) are 
predominantly occupied by late-type galaxies, while larger mass halos (~ 3 x lO^^h^^ Mq) 
contain the majority of the early-type galaxies and also a large fraction of the latc-type 
galaxies. Thus, to model the late-type galaxy correlation function, a combination of one- 
halo and two-halo terms must be used, which naturally results a smooth transition between 
different correlation function slopes at small angles and large angles. 

We note, however, that our work differs from the Zehavi et al. (2005) work in two 
ways, which is reflected in Figure 11. First, by using a spectroscopic sample, Zehavi et 
al. (2005) were able to restrict their analyses to volume-hmited samples. Our analysis is 
magnitude-limited, on the other hand, and therefore suffers from projection effects. Our 
apparent magnitude limit of r = 21 can naively be visualized as a large volume containing 
intrinsically luminous sources and a smaller volume containing intrinsically fainter sources. 
Given the clustering dependence on luminosity (see, e.g., Zehavi et al. 2005, and references 
therein) the correlation function of these two samples would clearly be different. Since we 
observe their combination, we naturally see a broader transition region (e.g., compare our 
Figure 11 with Figure 22 from Zehavi et al. (2005)). 

Second, in a 2-point correlation function, only two halo terms need to be used, as 
two points can cither be in the same halo or in two different halos. As N increases, the 
number of allowable point configurations also increases (sec, e.g., Bernardeau et al. 2002); 
and these points can be distributed amongst one, two, or N halos. This increase in 
the number of configurations and in the number of dark matter halos in which they are 
distributed could explain the increase in the small-angle correlation function slope for late- 
type galaxies as N increases. We expect to address both of these issues in a future work, 
by using photometric redshifts to minimize projection effects and quantify luminosity effects 
on a; AT, and by performing an HOD analysis that includes additional halo terms for the 
higher-order correlation functions. 

Figure 13 is also very interesting. Naively, in the absence of higher-order biasing, one 
would interpret differences in the ratios of 5*3 and 5*4 between early- and late-type galaxies 
as tracing differences in the linear bias of these galaxy types. Under this paradigm, given 
that the first-order bias is inversely proportional to the hierarchical amplitudes (see, e.g.. 
Equation 19), we might conclude that we had merely rediscovered the morphology-density 
relation for galaxies (e.g.. Dressier 1980). As expected under such an interpretation (see, 
e.g.. Goto et al. 2003), we find, in clustered environments, that early- types are more biased 
than late-types out to some characteristic scale (around 0.5 h"^ Mpc), on which late-types 
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begin to dominate. However, as we demonstrated in the previous two subsections, our 
work suggests that it is extremely unhkely that higher-order bias terms are unimportant, 
particularly when comparing early- and late-type galaxies. 

As higher-order bias is important, one might consider an alternative explanation. The 
blue/red color cut we use should select late-type galaxies as those that are either actively 
star-forming, or have undergone recent star formation. It is therefore possible that our 
classification of a galaxy as late-type is correlated with the likelihood for a galaxy to be in 
an environment that is conducive to forming stars. Such unsettled environments should be 
enhanced by merger events that will be traced by tight groupings of galaxies. Therefore, on 
scales less than 0.5 h^^ Mpc we find S3 and 5*4 to be large for late-type galaxies, as their 
star-forming activity will be traced by compact, unusual configurations of galaxies. This is 
particularly true in comparison to early-type galaxies, which, as they are no longer forming 
stars, are in settled environments that one would not expect to be traced by mergers or tight 
groupings of galaxies. 

On scales larger than ~ 0.5 h"^ Mpc, Figure 13 suggests a transition towards early- 
type galaxies having more interesting configurations (i.e., at least 3-4 neighboring galaxies on 
these scales) than late-types. This can easily be interpreted, in an ensemble sense, as redder 
galaxies dominating highly- evolved, cluster-like environments, and bluer galaxies dominating 
void-like environments (e.g., Rojas et al. 2004). Thus, our work predicts that in groups of 
galaxies with characteristic scales < 0.5 h"^ Mpc one should expect merger activity that is 
triggering star formation, whereas groups with characteristic scales of > 0.5 Mpc should 
correlate with more relaxed cluster environments. In future work, we hope to fit Figure 
13 with models of the relative density profiles that we infer from the local environments of 
early- and late-type galaxies. 

7. Conclusions 

We have measured the area-averaged angular correlation functions, {N < 7), for 
over 11 million galaxies selected from the SDSS DR3 to have 18 < r < 21, by far the largest 
sample ever used to perform this type of measurement. Our large number of objects allowed 
us to split the full sample, allowing us to perform consistency checks and to inquire into 
the relationship between galaxy-type and clustering. Our data samples consisted of four 
magnitude bins (18 < r < 21, 18 < r < 19, 19 < r < 20, 20 < r < 21) and a sample of 
early- and late-type galaxies selected in the magnitude range 18 < r < 21. 

Using our measured (Djv, we calculated the projected hierarchical amplitudes, sjv. Through 
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the use of selection functions empirically determined from the photometric redshifts of galax- 
ies from SDSS DRl, we deprojected our sjv and u)2 measurements into the real-space ampli- 
tudes, S'iv, and the real space volume-averaged 2PCF, ,^2- With these values, we were able 
to test the Gaussianity of the primordial density field and measure galaxy bias as a function 
of galaxy type. 

We checked that the systematic effects of reddening and seeing were not major influences 
on our measurements. We found that reddening results in no appreciable effect, even on scales 
larger than 1°. To be safe, we introduced a reddening cut at Ag = 0.3 (~ Ar = 0.2), consistent 
with the mask suggested by Scranton ct al. (2002). Seeing introduces slight structure into 
the measurements of sn, but we found that using a seeing cut at I'.'S minimized the error on 
our Sn measurements. 

Our resulting measurements of share consistent amplitudes across each magnitude 
bin, though there are differences in their structure. The amplitudes in each magnitude 
bin are roughly constant at large scales, consistent with the hierarchical model predicted 
from a Gaussian primordial density field. Assuming that the first-order bias bi is equal to 
one when sampling all galaxies, we find significant second and third order bias terms of 
C2 = —0.24 ± 0.08 and C3 = 0.98 ± 0.89. The value of C2 is consistent with the findings 
of Gaztahaga et al. (2005), but slightly inconsistent with those of Hikagc ct al. (2005) and 
Pan & Szapudi (2005). Inconsistencies between results should not be surprising based on 
the differences in the samples and techniques used, especially if separate galaxy data sets 
sample different galaxy types. Our marginal result of non-negligible C3 is the first that has 
been reported using SDSS data. Based on this analysis, we postulate that it is likely that 
higher-order bias terms are also significant. 

There are startling differences between our measurements of using early-type galaxies 
and those using late-types. At the small scales (f < 1 h^^ Mpc), we find the ratio of 
SN,eariy to S^Me incrcascs with scale. Given that we find higher-order bias terms to be 
important for early- and late-type galaxies, we attribute this to the fact that late-type 
galaxies, as we select them, display evidence of recent star formation, and thus might be 
expected either to be undergoing, or have recently undergone, merger activity. These galaxies 
exist, therefore, in unsettled environments, traced by interesting overdensities of galaxies on 
scales of < 0.5 Mpc. Early-type galaxies, on the other hand, typically occupy more 
settled environments, characteristic of evolved and relaxed galaxy groups and clusters. Thus, 
we find that SN^eariy remains roughly constant as the scale gets smaller, while SN,iate shows 
a dramatic increase. 

The ratios SN,eariy to SN,iate are roughly constant at scales f < 1 h^^ Mpc. We find 
this can only be explained by large differences in both the linear and higher-order bias. This 
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result is independent of any assumption on the value of the first-order bias of all galaxies. If 
we assume that bi = 1 for all galaxies, we find the results summarized in Table 5. Our result 
that C2,eariy = 0.28 ± 0.10 is consisteut with Frith, Outram, & Shanks (2005b), suggesting 
that they were correct in asserting their result was due to the fact that the 2MASS galaxy 
catalog is dominated by early-type galaxies. 

Our results bode well for future measurements using SDSS data. Using the fifth data re- 
lease (DR5) of the SDSS, the total area will increase by nearly 70%, allowing for more precise 
measurements and also for greater precision on large scales, which should help tighten the 
constraints on the nature of the primordial density field. Also, DR5 will include photometric 
redshifts for each galaxy, which will aid in the deprojection process, allow one to study the 
dependence of clustering on luminosity, and allow one to explore any cosmic evolution in the 
hierarchical amplitudes. We anticipate that this information will aid in producing a clearer 
picture of the nature of galaxy bias, both for all galaxies and as a function of galaxy type. 
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A. From Connected Moments to N-point correlations 



The N-point correlations, ujn, can be derived from the connected moments, mjy via the 
connected central moment, /i^, as described in Gaztaiiaga (1994). Here, we present the 
relevant formulae ioi N < 7: 




(Al) 



and 



A«2 = rn2; A;2 = - n; 



(A2) 



1^3 = ms; k3 = Us - 3k2 - n; 



(A3) 



//4 = 7714 — 3m2; ki — jiA — 7k2 — Gks — n; 



(A4) 



A*5 = rrir, — 10m3m2; k^ = — 15/c2 — 25/^3 — IO/C4 — n; 



(A5) 



This preprint was prepared with the AAS macros v5.2. 
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A*6 = ^6 ~ 15777.4777.2 — 10772.3 + 30772.2; = /Iq — 31k2 — QOk^ — Gik^ — Ibk^ — n; (A6) 



fj,j — TTiy — 2 177i577i2 — 3577147713 + 21077137712; — //y — 63A;2 — 301A;3 — 350A;4 — 140A;5 — 21A;6 — 7i; 

(A7) 
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Table 1. Measured values for the exponent, a^-, and the amphtude aX 1 h ^ Mpc, Bm-, 
assuming power-law behavior for hierarchical amplitudes S'jv, N <7/\n. the bin 

18 < r < 21. 





0.2 < f < 2.0 h-^ Mpc 


2.0 < f < 10 h-^ Mpc 


N 










3 


0.065 ± 0.009 


4.66 ±0.06 


-0.31 ± 0.04 


c 1 c+0.19 
^■^^-0.24 


4 


0.037 ±0.036 


41.2 ±2.1 


-0.73 ±0.09 


67.4 ±7.0 


5 
6 
7 


0.36 ±0.03 
0.28l°i^ 


810 ±40 
(1.361°:^) X 10^ 
(3.3 ± 1.7) X 10^ 


-0.69l°il 
-0.27+°;- 
-0.871°:^° 


990^^?° 
(8.8l|^) X 10^ 
(3.1 ±2.3) X 10^ 



Table 2. Measured values for the exponent, a at, and the amphtude at 1 /i ^ Mpc, B^, 
assuming power-law behavior for hierarchical amplitudes S'jv, A?" < 7, in the bin 

18 < r < 19. 



0.2 < f < 2.0 h-^ Mpc 2.0 < f < 10 h'^ Mpc 



N ajsi Bn Q!jv Bn 



3 


0.03 ± 0.03 


3.86 ±0.03 


-0.13 ±0.05 


3.50 ±0.20 


4 


0.07 ±0.10 


34.8 ±2.7 


-0.22 ±0.15 


26.7 ±3.8 


5 




417 ±94 


-0 12+°-^^ 


258 ± 55 


6 


0.281^:1 


(5.32 ± 2.43) X 10^ 


'-'•'-"-'-0.50 


(2.68 ± 1.05) X 10^ 


7 




X 10^ 




(2.9 ±2.0) X 10^ 
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Table 3. Average sn and Sn taken for data between 0.2 and 10 h ^ Mpc for galaxies with 

18 < r < 19 and 18 < r < 21 



N 


18 < r 


< 21 


18 < r 


< 19 


Sn 


Sn 


Sn 


Sn 


3 


5.38 ± 0.03 


4.72 ± 0.03 


5.69 ± 0.08 


4.24 ± 0.06 


4 


67.4 ± 2.1 


49.9 ± 1.5 


66.4 ± 4.8 


34.8 ± 2.5 


5 


1040 ± 70 


635 ± 42 


1400 ± 90 


492 ± 32 


6 


(1.71 ± 0.21)xl0^ 


(8.51 ± 1.05)xl03 


(3.87 ± 0.16)xl0^ 


(8.90 ± 0.37)xl03 


7 


(4.44 ± 0.80)xl0^ 


(1.78 ± 0.32)xl05 


(8.38 ± 0.20)xl05 


(1.23± 0.29)xl0^ 



Table 4. Average sn and -^jv taken for data between 1 and 10 h ^ Mpc for early- and 

late-type galaxies with 18 < r < 21 



Early-type Late-type 



N 










3 


4.76 ± 0.11 


4.24 ± 0.10 


2.83 ± 0.19 


2.32 ± 0.16 


4 


47.6 ± 3.3 


36.6 ± 2.5 


22.5 ± 1.9 


14.4 ± 1.2 


5 


1780 ± 40 


1160 ± 30 


149 ± 78 


72.7 ± 39 


6 


(4.96 ± 0.13)xl0^ 


(2.68 ± 0.07)xl0^ 


(4.73 ± 1.47)xl03 


(1.73 ± 0.54)xl0=^ 


7 


(1.75 ± 0.64)xl06 


(7.78 ± 2.84)xl05 


(1.1 ± 4.6)xl05 


(0.3 ± 1.3)xl04 



Table 5. The determined values of first-, second-, and third-order bias for late-type, 
early-type, and all galaxies, assuming that bi = 1 for all galaxies. 



Sample 


h 


C2 






C3 




All, 18 < r < 21 


1 


-0.24 ± 


0.08 


0.98 


± 


0.89 


Ah, 18 < r < 19 


1 


-0.31 ± 


0.08 


1.23 


± 


0.94 


Early-type, 18 < r < 21 


1.36 ± 0.04 


0.30 ± 


0.10 


1.46 


± 


1.37 


Late-type, 18 < r < 21 


0.81 ± 0.03 


-0.71 ± 


0.07 


0.63 


± 


1.04 
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Fig. 1. — Normalized number of galaxies, n{z), in redshift bins Az — 10"^ determined from 
photometric redshifts of galaxies with 18 < r < 21. This curve defines the redshift selection 
function wc use in our dcprojcctions from angular to real-space for hierarchical amplitudes 
the magnitude bin 18 < r < 21. Similar curves are constructed for the other magnitude 
bins. 
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Fig. 2. — The probability of having counts in a cell for resolutions corresponding to 
the angular scales 0.02°,0.08°,0.16°,0.3°,0.6°,1.3°,2.6°, and 5.2° (from left to right). The 
discretization in these large angle curves indicates make precise measurements at scales 
larger than a about a degree. 
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Fig. 3. — The S3 hierarchical amphtude for 18 < r < 21 galaxies with g-hand reddening cuts 
at 0.35 (open circles); 0.3 (open squares); 0.25 (filled circles); 0.2 (filled squares); 0.15 (filled 
triangles) . The 9 values have been shifted for each seeing cut for clarity. We see little change 
in the amplitudes as a function of the reddening cut. 
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Fig. 4. — The S3 hierarchical ampHtude at 1.3° versus the gr-band reddening cut that is 
apphed. We determine the preferred cut to be at = 0.3. 
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Fig. 5. — The S3 hierarchical amphtudes of 18 < r < 21 galaxies with seeing cuts at 1'.'4 (open 
circles); 1'.'5 (open squares); I'.'G (open triangles); 1'.'7 (filledcircles) ; I'.'Q (filled squares); and 
2'.'2 (filled triangles). The 9 vahics have been shifted for each seeing cut for clarity. Seeing 
does introduce minor structure into our measurements, at a level below our errors. The error 
on the measurement is minimized at our preferred value of 1'.'5 for most scales. 
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Fig. 6. — Galaxy-seeing cross-correlations with seeing cuts at 1'.'4 (green, open circles); I'.'S 
(blue, open squares); 1'.'6 (red, open triangles); 1'.'7 (magenta, filled circles); I'.'S (cyan, filled 
squares); and 1'.'9 (black, filled triangles) for 18 < r < 21 galaxies. The 9 values have 
been shifted slightly for each seeing cut for clarity. The cross-correlation for the cut at out 
preferred seeing cut of I'.'S is the most consistent with zero. 
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Fig. 7. — The values uj3{9) and >S3(^) for each SDSS data stripe. There is minimal variation 
between stripes and no one stripe dominates the measurements. 
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Fig. 8. — The area-averaged angular N-point correlation functions for 18 < r < 21 galaxies 

for N — 2, ,7 (left to right, top to bottom). Note the logarithmic scaling. The power-law 

behavior is observed for all N, consistent with the hierarchical model. The open triangles 
designate data calculated using the striped method, while solid triangles represent the data 
calculated using the SDSSpix method. There is no measurement using the SDSSpix method 
at 0.12 degrees because this scale is not an integer multiple of its base resolution scale (0.08 
degrees) . 
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Fig. 9. — The hierarchical amphtudes S3 through S7 for galaxies with 18 < r < 19 (top left), 
19 < r < 20 (bottom left), 20 < r < 21 (top right), and 18 < r < 21 (bottom right). The 9 
values for each amplitude have been shifted and data with extremely large errors are omitted 
for clarity. The amplitudes are roughly constant, consistent with the hierarchical model. 
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Fig. 10. — The hierarchical amphtudes 5*3 through 5*7 for galaxies with 18 < r < 19 (top 
left), 19 < r < 20 (bottom left), 20 < r < 21 (top right), and 18 < r < 21 (bottom right). 
The f values for each amplitude have been shifted and data with extremely large errors are 
omitted for clarity. The amplitudes are roughly constant, consistent with the hierarchical 
model. 
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Fig. 11. — The area-averaged angular N-point correlation functions for 18 < r < 21 early- 
type (red, triangles) and late-type (blue, squares) galaxies, for = 2, ,7 (left to right, top 

to bottom). Note the logarithmic scaling, and that the 9 values of the early- type data have 
been shifted to slightly lower values for clarity. Clear differences between the two galaxy 
types are evident in the slopes and amplitudes. 
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Fig. 12. — The hierarchical amphtudes S3 through sj (bottom) and 5*3 through S7 (top) for 
late-type (left) and early-type (right) galaxies. The 9 values for S4 and sj have been shifted 
and data with extremely large errors are omitted for clarity. At large scales, the early-types 
have significantly larger amplitudes than the late-types, implying a significant difference in 
bias between the two samples. 
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Fig. 13.— The ratios of S^^eariy to Ss^iate (blue, triangles) and S4,eariy to S4,iate (red, squares) 
for galaxies in the magnitude range 18 < r < 21. Data with large errors are omitted for 
clarity. These at large scales, the ratios indicate that there is a significant difference in bias 
between early- and late-type galaxies. The ratios at small scales may be indicative of recent 
merger activities in late-type galaxies. 



